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having  the  following  common  structure:  Player  1  (Emitter)  produces  a 
binary  periodic  sequence;  Player  2  (Predictor)  observes  some  initial 
segment  of  this  sequence  and  then  tries  to  predict  the  next  digit.  The 
payoff  to  Emitter  is  zero  if  the  prediction  is  a  correct  one.  The  games 
differ  in  additional  assumptions — those  are  in  particular:  (±0  Predictor 
required  to  make  his  prediction  after  observing  a  prescribed  number  of 
digits  of  the  sequence)  (2)  Predictor  allowed  to  observe  any  number  of 
digits  but  earning  a  decreasing  amount  for  each  correct  prediction  ae 
the  number  increases^  (-3)  The  period  of  the  emitted  sequence  being 
chosen  by  random  from  some  fixed  distribution)  (A)  Emitter  allowed  to 
choose  the  period  but  being  paid  a  decreasing  amount  for  incorrect  pre- 
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sum  and  two  nonzero-sum  games  are  obtained.  It  is  shown  that  alJ  these 
games  possess  a  solution,  some  are  at  least  partially  solved  and  their 
further  properties  investigated. 
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1.  INTRODUCTION. 

In  this  paper  we  study  certain  types  of  infinite  games  which 
belong  to  the  class  of  discrete  emission-prediction  games,  also 
known  as  discrete  games  of  aiming  and  evasion.  An  emission-pre¬ 
diction  game  is  a  two-person  game,  where  the  first  player,  to  be 
named  Emitter,  produces  a  sequence  of  elements  from  some  fixed 
finite  set  A.  The  second  player,  to  be  named  Predictor,  is  allowed 
to  observe  the  sequence  for  some  time  and  is  then  required  to  make 
a  prediction  of  some  kind  about  the  future  behavior  of  the  sequence. 
The  general  Predictor's  goal  is  to  make  a  correct  prediction,  while 
Emitter  wants  to  avoid  this. 

Emission-prediction  games  may  serve  as  models  for  a  variety 
of  conflict  situations.  For  instance,  Emitter  could  be  identified 
as  an  attacker  (bomber,  submarine,  guided  missile,  guerilla 
unit)  performing  a  series  of  evasive  maneuvers,  which, in  the  dis¬ 
crete  time,  represent  the  sequence  being  emitted.  Predictor  is 
trying  to  destroy  the  attacker.  Since,  in  general,  the  attacker 
can  be  Intercepted  only  after  one  or  more  time  units  have  elapsed 
from  the  last  observed  maneuver  (or  position) ,  Predictor  must  be 
able  to  correctly  predict  future  maneuvers  (positions)  of  the 
attacker. 

To  our  best  knowledge  the  study  of  emission-prediction  games 


was  initiated  by  Isaacs  (the  "bomber  and  battleship"  problem 

and  have  since  been  investigated  by  Dubins  [2],  Isaacs  and  Karlin  [5], 
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Isaacs  [4],  Karlin  [6],  Blackwell  [1],  Ferguson  [3]  and  Macula  [7]. 
The  assumptions  made  by  these  authors  can  be  summarized  as  follows: 

(1)  No  restriction  is  placed  upon  the  emitted  sequence. 

(2)  Predictor  is  allowed  to  observe  the  sequence  as  long  as  he 
wishes  before  he  decides  to  make  a  prediction  concerning 
several  subsequent  terms  of  the  sequence. 

(3)  The  game  is  a  zero-sum  game  with  the  payoff  depending  only 
on  the  discrepancy  between  the  prediction  and  the  actual 
values . 

The  emission-prediction  games  investigated  in  this  paper 
differ  mainly  in  one  crucial  aspect;  we  assume  that  the  emitted 
sequence  is  a  periodic  one.  Apart  from  this  being  of  interest  per 
se  we  have  been  motivated  by  the  following  idea.  Suppose  that  the 
attacker  is  a  simple  automatic  device  with  built-in  preprogrammed 
ability  to  perform  evasive  maneuvers.  The  trajectory  of  this 
device  will  then  follow  a  periodic  pattern  with  some  period  depend¬ 
ing  upon  the  complexity  of  the  program  and  unknown  to  Predictor— 
the  defender.  Even  in  situations  where  the  attacker  is  controlled 
by  a  human  operator  periodicity  may  very  well  serve  as  a  first 
approximation.  We  believe  that  most  humans  do  exhibit  a  kind  of 
cyclic  pattern  when  asked  to  perform  a  series  of  evasive,  i.e. 
unpredictable  maneuvers. 

Assuming  periodicity  of  the  emitted  sequence,  however, 
compels  us  to  place  some  additional  restrictions  on  Emitter  and/or 


~*ii'  ••  ***• .  - 
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Predictor  since  otherwise  the  game  would  not  possess  a  solution. 

The  additional  restrictions  considered  in  this  paper  are:  the 
period  is  chosen  by  an  independent  chance  mechanism,  bounds  on  the 
length  of  observation  intervals,  and  penalties  for  long  periods  and 
lengthy  observation.  As  for  the  type  of  prediction  required  we 
limited  outselves  to  the  simplest  case  of  predicting  the  next  term 
of  the  sequence.  (With  the  periodicity  assumptions  this  is  by  no 
means  trivial  as  it  would  be  without  it.)  For  the  sake  of  simpli¬ 
city  of  notation  we  consider  binary  sequences  only;  most  of  the 
results  of  this  paper  extend  easily  to  the  m-ary  case. 

Another  possible  application  of  the  model  may  be  that  of 
an  optimum  jamming  strategy  for  a  missile  defense  system.  Suppose 
that  a  ship  is  being  attacked  by  a  missile  equipped  with  a  target 
search  radar.  To  avoid  detection  the  search  radar  is  not  active 
all  the  time  but  rather  is  constantly  being  switched  on  and  off  in 
some  programmed  pattern.  The  intensity  of  illumination  of  the  tar¬ 
get  by  the  search  radar  then  follows  a  binary  periodic  sequence 
(in  a  discrete  time).  The  defender  of  the  target  (ship)  wants  to 
jam  the  search  radar.  The  jamming  device,  however,  must  not  be 
transmitting  all  the  time  since  the  missile  could  then  home  on  the 
transmitter  and  hence  the  target.  Thus,  the  defender  faces  the 
problem  of  predicting  the  next  mode  of  operation  of  the  search 
radar  after  observing  the  illumination  pattern  for  some  limited 
time,  i.e.,  predicting  next  term  of  the  binary  sequence. 
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2.  PRELIMINARIES. 

Throughout  this  paper  the  symbol  A  will  denote  the  two- 
element  set  A  -  {0,1}  and  the  symbol  0  with  generic  elements 
u  will  denote  the  set  of  all  periodic  sequences  of  zeros  and  ones. 
We  say  that  a  binary  sequence 

u>  ■  (u).,u -,...)  ;  w  €  A;  n  *  1,2,...; 
i  i.  n 

is  periodic  with  the  period  ir(u)  ■  t;  t  ■  1,2,...,  if 

wn+kt  "  wn  ^°r  evtry  n  *  1,2,...;  k  «  1,2,...  . 

Obviously,  each  periodic  sequence  m  €  0  has  only  one  period,  and 
we  denote 

flt  -  {w  €  0  :  Tr(ui)  -  t} 

so  that  0  is  a  disjoint  union  of  . 

If  a )  €  0  and  n  ■  1,2,...  then  u>n  will  denote  the 
n-th  term  and  w|n  the  ordered  n-tuple  of  the  first  n  terms 
of  the  sequence  <d. 

Let  An  be  the  set  of  all  ordered  n-tuples  of  zeros  and 

ones.  If  a  ■  (a. , . . .  ,a  )  €  An  and  3  €  A  then  a3  is  the 

l  n 

00 

ordered  (n+1) -tuple  (a,,..., a  ,3)  and  (a)  is  the  periodic 

l  n 

sequence 

00 

(i)  *  (ot_  $•••$&  !•••)•  ClcAtly f  it ( (fl)  )  i  n» 

x  n  i  n 
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For  every  a  €  AB  and  t  ■  1,2*...  we  define  the  function 

(1  if  there  is  u  €  Qt  such  that  u>  ■  (a)", 

(1) 

0  otherwise , 

and  the  function 

i 

A(a>  -  min{t  -  1,2,...  :  It(a)  -  1}.  (2) 

Thus  I  (a)  serves  as  an  indicator  of  "potential  periods"  of  a 
and  A  (a)  la  the  "smallest  potential  period"  of  a. 

Later  we  will  need  the  following  lemma. 


Lemma  2.1: 
X(M)  >  fe 
number  x. 


For  every  a  €  An  there  exists  a  6  A  such  that 
+  1,  where  [xj  denotes  the  integral  part  of  the 


Proof:  For  any  n-tuple  a«  (o^,...,an>  denote 

T(a)  -  {t  •  l,...,n-l  :  I^Ca)  -  1}  if  n  >  1 
and  T(a)  ■  0  if  n  ■  1.  Notice  first  that 

T(aO)  n  T(al)  -  * 

i 

and  (3) 

T(aO)  U  T(al)  -  T(a)  U  {n>. 

This  follows  by  realizing  that  if  t  €  T(a)  then  a  ■  (a^,.*.,a£; 
o^,...,at;...;o^,...,o^),  1  £  i  £  t  so  that  t  €  T(ao^) 


l 
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*i  i 


if  1  <  t  and  t  €  T(aa^)  if  i  -  t.  Conversely,  if  t  6  T(a<*i+^) 
and  t  <  n  then  t  €  T(a)  and  clearly  t  £  T(ao)  for  a  j*  a. , , . 


i+1 


Finally,  n  €  T(aa)  if  and  only  if  a  ft  a^. 


Let  now  a  -  (a. , . . .  ,a  )  and  assume  n  >  2  since  the 
i  ,n 

lemma. is  trivial  for  n  ■  1,2.  If  T(a)  -  0  then  by  (3)  T(aO) 

i 

U  T(al)  ■  {n>  so  that  A(ao)  St  n  for  any  a  €  A. 

.  *  ,  j 

If  T(a)  i  0  then  there  is  b  -  (e^,.. .,8^)  and 

, 

c '■  (Y-, » • . « »Y#)  such  that 

■  1  ^  i  •'  * 

:  I  •  * 

i  •  . 

a  -  (be  be  .1.  be  b)  and  T(bc)  ■  0. 

1  nT  times  ^ 

'  \  . 

,  ;  |  t 

If  T(b)  -  0  then  T(a)  -  {k  +  4}  and  by  (3)  X(ao)  i  n  for  some 
a  €  A. 

.  !  i  !  ! 

If  T(b)  i  0  then  there  is  e  -  )  and  d  -  (6, . 6  ) 

i  r  x  8 

\  *  j 

such  that 


b  -  (ed  ed  ...  ed  e)  and  T(ed)  ■  0. 


Hence  taking  o  ,0  ■  8^  we  have 


X(aa)  2  m(lcK)  +r+sik+i+2"n-k+2 


and  since  2k  £  n  -  1  we  obtain 


A(aa)  *  +  2  > 


+  1.  The  lemma  is  proved. 
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3.  ZERO-SUM  GAMES  WITH  RANDOM  PERIOD. 

In  this  section  we  consider  the  case  where 

(1)  the  period  t  of  Emitter's  sequence  is  chosen  randomly 
according  to  some  distribution  v  known  to  both  players, 

(2)  Predictor  is  allowed  to  observe  the  first  n  digits  of 
the  Emitter's  sequence  and  tries  to  predict  the  next 
(n+l)st  digit, 

(3)  the  payoff  to  Emitter  is  zero  if  the  prediction  is  correct 
and  one  otherwise. 

In  other  words,  the  expected  payoff  is  the  probability  of 
wrong  prediction,  which  the  Emitter  is  trying  to  maximize  and  the 
Predictor  to  minimize. 

We  will  denote  the  resulting  zero-sum  game  by  G(v,n), 
where  v(t),  t  ■  1,2,...  is  the  period  distribution, 

00 

v(t)  iO,  t  -  1,2,...  ;  l  v(t)  ■  1 

t-1 

and  n  *  1,2,...  is  the  length  of  the  sequence  observed  by  the 
Predictor. 

Next  we  describe  Emitter's  and  Predictor's  pure  strategies. 
To  specify  a  pure  strategy  Emitter  must  choose  for  every 
t  ■  1,2,...  a  binary  sequence  with  the  period  t.  Emitter's  pure 
strategies  s  are  therefore  sequences 

8  ■  { s^ ,  t  ■  1,2,...},  where  st  € 
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The  set  S  of  all  Emitter's  pure  strategies  Is  then  an  infinite 
Cartesian  product  S  ■  x  ^  x  ...  of  finite  sets  and  is 
therefore  uncountable. 

Predictor's  pure  strategies  4  are  simpler  to  describe. 

Each  $  is  a  mapping  of  An  into  A  ■  {0,1}.  For  each  a  €  An, 

♦(a)  is  the  prediction  of  the  (nfl)st  term  of  the  Emitter's 

sequence  if  the  first  n  terms  observed  are  the  n-tuple  a.  The 

set  ♦  of  all  Predictor's  pure  strategies  is  therefore  a  finite 
2n 

set  of  2  elements. 

The  payoff  function  W(s,+)  is  then  given  by 


W(s,*)  -  l  v(t)  W  (s,<fr)  ;  stS,  <fr  €  *, 
t-1  c 


where  for 


t  -  1,2, .. . 


Wt(s,«>) 


and  «  ■ 

'1  if  <t>(w|n)  t  “n+i* 

'0  if  (w|n)  “ 


It  is  easy  to  see  that  W((s,4)  is  nothing  but  conditional  proba¬ 
bility  of  wrong  pi  diction  given  that  the  period  is  t  and  hence 
W(s,$)  is  the  unconditional  error  probability. 

The  game  G(v,n)  is  now  formally  defined  by  the  triplet 
(S,4,W).  This  is  an  infinite  zero-sum  game,  however,  since  the  set 
<t>  is  finite  and  the  layoff  function  is  nonnegative  and  bounded  by 
one  we  have  by  the  well-known  theorem  of  Wald  [8]  the  following 


proposition. 
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Proposition  3.1:  The  game  G(v,n)  has  a  value  and  Predictor  has 
an  optimal  strategy. 

Let  us  turn  our  attention  to  the  existence  of  Emitter's 
optimal  strategy.  By  another  Wald's  theorem  ([8],  Theorem  2.20) 
an  optimal  strategy  exists  if  the  corresponding  space  of  pure  strate¬ 
gies  is  totally  bounded  with  respect  to  the  intrinsic  metric 
([8],  sec.  2.1).  Nov  the  space  S  of  Emitter's  pure  strategies 
is  totally  bounded  in  the  above  sense  if  for  every  e  >  0  there 

exists  a  finite  subset  S  of  S  such  that  for  every  s  6  S 

€ 

there  is  an  r  6  S  for  which 

e 

max] W(s,+)  -  W(r,t) |  <  e. 

♦€* 

It  is  easy  to  see  that  this  condition  is  satisfied  for  the  game 

G(v,n);  take  an  integer  T  such  that  ][  v(t)  <  e  and  define 

t>T 

S  to  be  the  set  of  all  pure  strategies  s  -  {st,  t  -  1,2,...} 
with  st  fixed  for  t  >  T.  Thus  we  have  proved  the  following 
proposition. 

Proposition  3.2:  In  the  game  G(v,n)  Emitter  has  also  an  optimal 
strategy. 

In  general,  optimal  strategies  will  be  mixed  strategies, 
that  is  probability  distributions  on  the  corresponding  spaces  of 
pure  strategies. 

Let  X  be  the  space  of  Emitter's  mixed  strategies.  Since 
S  is  a  product  space  a  mixed  strategy  x  €  X  will  be  a  product 
probability  measure 
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x  ■  x1  X  x2  x  ,,,  , 

where  xf  Is  a  probability  distribution  on  the  finite  set 
In  other  words,  x  6  X  is  completely  specified  by  a  sequence  of 

probability  distributions  x£;  t  ■  1,2 .  The  payoff  function 

W  then  extends  onto  X  x  ♦  by 


W(x,$)  -  |  W(s,+)dx(s) 
S 


i  l  v(t)W  (s,$)dx(8) 
S  t-1 


08  (  • 

-  I  v(t)  W  (s,*)dx(s)  -  l  v(t)  l  W  (s,+)x  (s  ). 
t-1  '  t  t-1  st6Jt  *  *  * 

Predictor's  mixed  strategies  y  €  Y  are  again  probability 
distributions  on  the  finite  set  ♦  .  Thus  for  every  such  y  and 
s  6  S  we  can  define 

<0 

W(s,y)  ■  l  W(s,*)y($)  -  £  v(t)  [  W  (s,*)y($). 

♦€♦  t-1  ♦€♦ 

Let  y  €  V  and  let  for  every  a  6  An,  o  6(0,1}  f  (a)  -  £  y($) 

a  {«>  :$(a)-a} 

Clearly,  f^Ca)  the  probability  of  predicting  a  if  the 
n-tuple  a  has  been  observed  and  if  Predictor  uses  the  mixed 
strategy  y.  Thus  f  -  (fg,fj)  could  be  called  Predictor's 
behavioral  strategy  and  we  have  for  u  ■  s^. 
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W  <s,f)  -  l  W  (s^)y(O  - 

rfofel“> 

if 

Vi  *  1> 

(4) 

if 

Vi  ■  °- 

Conversely,  if  f  «  (f^.f^)  where 

f  maps 

An 

into  [0,1] 

and 

fg(a)  +  f^(a)  -  1  for  all  a  €  An 

then 

y(*)  -  I  f0(a)  + 

l  £iU) 

> 

♦  €  *, 

{a€An:$(a)-0}  {a€An:$(a)-l} 


la  a  mixed  strategy  and  (4)  holds.  Hence  we  can  work  with  the  set 
F  of  Predictor's  behavioral  strategies  instead  of  with  the  set  Y. 


Proposition  3.3:  The  value  v  of  G(v,n)  is  given  by 


v  -  min  l  v(t)  max  max{I  (aO)f.(a),I  (al)fn(a)}  (5) 

f€F  t-1  a€An  c  l  t  u 


and  f  is  Predictor's  optimal  strategy  if  and  only  if  it  minimizes 
the  series  above. 


Proof:  Since  G(v,n)  has  a  value  we  must  have  v 
Now 


min  sup  U(s»f). 
f€F  s£S 


sup  W(s,f)  ■  l  v ( t)  max  W  (s,f)  , 
s€S  t-1  st6Jt 


max 

8t€flt 


Wt(s,f) 


max{f.  (a) } , 
i-a 


and  by  (4) 
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where  the  latter  maximum  is  taken  over  the  set  of  all  (rrt-1) -tuples 
aa  €  A0*1  for  which  w]”*1  ■  aa  for  some  w  €  flt»  i.e.  for 
which  It(aa)  ■  1.  Hence 

max  W  (s,f)  -  max  max{I  (aO)f.(a),I  (al)fft(a)} 
st€fit  a€A  t  i  t  u 

and  the  second  assertion  follows  from  the  existence  of  Predictor's 
optimal  strategy. 

Setting  f(a)  -  for  each  a  €  An  in  (5)  we  obtain 

Corollary:  0  £  v  < 

* 

Proposition  3.4:  There  exists  Predictor's  optimal  strategy  f 

such  that  for  each  a  €  An  either  f  (a)  •  (1,0)  or  f  (a)  »  (0,1) 

*  x  A  1% 
or  f  (a)  -  (^,y). 

Proof :  Let  for  every  t  -  l,...,n+2  and  a  €  An 

B(t,a)  -  Uq  :  fQ(a)  ■  «axn  max{I  (a0)f1(a)  ,It(al)f0(a) }} , 

a€A 

where  f^  denotes  the  vector  with  2n  components  fg(a).  Clearly, 
B(t,a)  is  a  convex  polyhedron  contained  in  the  2n-dimensional 
unit  hypercube.  Furthermore,  the  vertices  of  B(t,a)  are  vectors 
with  components  0  or  1  or  1/2  only. 

Let  u  ■  (a^, . . .  »an+2) »  €  An  and  let 

n+1 

W  ■  I  v<«=)£  U  )  +  f  (.  )  I  V(t). 

t-1  t>n+l 


13 


Mow  L  it  a  linaar  function  of  f-  and  hence  its  minimum  over 
u  ~0 

the  convex  polyhedron 


n+2 

B  -  H  B(t,a  ) 
t-1 

is  attained  at  one  of  the  vertices  of  B^.  Denoting  f|j  the 
vertex  at  which  the  minimum  is  attained  ve  see  that  for  each  a  6  An 

f|J(a)  -  0  or  1/2  or  1.  (6) 

Finally,  since  It<ao)  -  1  for  t  >  n  +  1, 


min  l  v(t)  max  m*x{I  (aOMaM  (al)fft(a)} 
f  t-1  a6An  t  i  t  u 


where 


u* 


minimizes 


It  follows  from  Proposition  3  that  fu*  -  (fg*,l-fg*)  is  an 
optimal  strategy  which  together  with  (6)  terminates  the  proof. 

Proposition  3.5:  Let  £  v(t)  >  Then  v  -  and 
x  x  t>nfl  1  1 

f(a)  -  (y,— ) ,  a  €  A  is  the  unique  Predictor's  optimal  strategy. 

* 

Proof :  Let  f  be  an  optimal  strategy  and  assume  that  for  some 
a  €  An,  o  €  {0,1}  f* (a)  + 


It  follows  from  the  proof  of 
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Proposition  3.4  that  there  must  be  another  optimal  strategy  with 
fQ(a)  ■  0  or  1.  Hence  assume  that  fa(a)  ■  1  (otherwise  take 
fl^OO).  Then 


I  v(t)  max max{I  (aO)f*(a),I  (al)f*(a)} 
t-1  a€An  t  l  t  o 


A  A  1 

*  waxn  maxU^aJ.f^a)}  l  v(t)  >  -r 

a£A  t>n+l 

so  that  v  >  which  is  a  contradiction.  Hence  f*(a)  -  for 
z  a  l 

all  a  6  An,  a  €  (0,1).  For  this  f* 


v-  l  v(t)  \  max max{I  (aO),I  (al)}  -  \ 
t-1  c  a€A  z  z 

since  for  every  t  -  1,2,...  there  exists  a  6  An  and  a  6  (0,1} 
such  that  It(a  )  ■  !• 

Proposition  3.6:  Let  the  distribution  v  be  such  that 

«((§]+ 1)>  l  v(t),  (7) 

t>  [f  ]+l 

where  [^J  is  the  integral  part  of  y.  Then 

I  v(t) 

t>[f]+l 


v 


(8) 
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and  the  Predictor's  behavioral  strategy  £  defined  by 


XUa)  £  [f]  +  1  -  f*(a)  -  0 


(9) 


is  an  optimal  strategy. 


n  |i  1 1 

Proof:  Let  us  sssubm  first  that  n  is  odd  so  that  [-|j  +  1  *  — j-< 

n  r>|1 

Let  e  €  A  be  the  n-tuple  which  has  1  at  the  t=jA)th  place 


and  zeros  elsewhere.  Clearly 


n+1 


I^CeO)  *  1  if  an  only  if  t  >  — j-, 


and 


It(el) 


n+1 


1  if  and  only  if  either  t  »  — j-  or  t  >  n  +  1 


Hence 


l  v(t)  maxn  max{It(aO)f1(a),It(al)f0(a)} 
t-1  a€A  “ 


2  l  v (t)  max{I  (eO)f . (e) ,1  (el)f-(e) } 
t-1  c  l  c  u 


vf5—)  fQ(e)  +  f^e)  l  v(t)  +  max{f1(e),f()(e)}  J  v(t) 


n+1 


:  tfin+1 


t>n+l 


l  v(t)  by  the  hypothesis.  Thus  v  k  £  v(t). 


n+1 


n+1 


Next  let  f  be  the  strategy  (9).  Then  by  Lemma  2.1 


i 
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vi  I  v(t)  max  max{I  (aO)f*(a),I  (al)f*(a)} 
t-1  a€A  c  j.  t  u 

-  I  v(t)  max  max{I  (aO)f* (a) ,1  (al)f*(a)} 

n+1  a€A  c  1  t  u 

C  2 

■  l  v(t). 

_  n+1 


Hence  v  ■  £  v(t)  and  f  Is  an  optimal  strategy. 


If  n  is  even  we  begin  with  e  having  1  at  the  (yH)s t  place. 

The  rest  follows  verbatim. 

If  the  hypothesis  of  the  above  proposition  is  not  satisfied 
the  game  G(v,n)  is  very  difficult  to  solve.  As  seen  from  Propo¬ 
sition  3.3,  this  amounts  to  solving  the  nonlinear  programming 
problem: 

00 

"minimize  T  v(t)  max  max{I  (aO)f . (a) ,1  (al)fn(a)}  (10) 

^  -l  ^ .  n  t  l  t  u 

t-1  a€A 

subject  to  f  €  F." 

It  is  true  that  it  can  be  solved  by  breaking  it  into  a  number  of  linear 
programming  problems  as  we  did  in  the  proof  of  Proposition  3.4  but 
the  size  of  this  task  is  still  formidable.  A  slight  simplification, 

7 

however,  can  be  obtained  by  considering  truncated  versions  of 
G(v,n). 
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Let  T  -  1,2 .  let  G^(v,n)  be  the  game  (S,*,WT) 

obtained  from  G(v,n)  by  truncating  the  distribution  v  at  T, 
that  Is  setting  v(t)  -  0  for  t  >  T  and  normalizing  to  one. 

T  T  T 

Proposition  3.7:  Let  v  ,  f  and  x  be  the  value  and  optimal 

i 

strategies  respectively  of  the  truncated  game  G^(v,n).  Then,  as 
T  - 


where  v,  f 
G(v,n) . 

Proof:  Since 


T 

v 


and 


X  *  T  * 

v,  f  -►  f  and  x  -*■  x  , 

* 

x  are  the  value  and  optimal  strategies  of 


v  Is  a  probability  distribution  and 


|VT(x,f)  -  W(x,f)|  £  l  v(t) 

t>T 


T 

U  (x,f)  converges  to  U(x,f)  uniformly  in  both  f  and  x,  which 
implies  the  statement. 

To  the  end  of  this  section  we  present  a  solution  of  a  small  game 
G(v,n) . 

Example :  Let  n  *  5,  v(t)  >  0  for  t  -  1,...,5  and  v(t)  ■  0 
for  t  >  5.  To  compute  the  right-hand  side  of  (5)  we  need  the 
values  of  I^(aa)  for  t  ■  1,...,5,  a  €A^,  a  ■  0,1.  These 
are  given  in  the  first  two  columns  of  Table  1.  Substituting  into 
(10)  we  obtain 


I 


I 


■  .  ,  1  18  . 

'  .  •  j  f 

I  , 

I 

i  I  1 

vfDf^ap  +  v(2)fQ(a11)  +  \K3)max{fQ(a5)  .f^a^)  ,fQ(a14) } 

t 

i 

+  v(4)max{f1(a3)  .fjUj)  »f1<a7>  »fo^a9^  »f0*a13^f0*a15^ 

i 

+  v(5)max{f.  (a.) ;  i  -  2,..., 16}  , 

1  1  i  1 

i 

+  v(l)f0(a32)  +  v(2)f1(a22)  +  v(3)max{f1(a28>  ,fQ(a23)  .f^a^)  >, 

,+  v(4)max{fo(a3O),fQ(a28^»fo^a26^*fl^a24^,fl^a20^*fl^a18^ 

+  v^maxU^a^:  i  -  17,..., 31}.  .  *  (11) 

•  I 

Looking  at  the  expression  (11)  as  a  function  of  64  nonnegative 

variables  !  fj(a^)  constrained  to  satisfy  fg(aj)  +  ^(a^)  “  1  we 

(  , 

see  that  the  only  variables  that  make  the  minimization  difficult 
are  those  for  which  both  f^a^)  and  f^(aj)  appear  in  (11). 

Hence  it  is  easy  to  conclude  that  the  slinimum  of  (11) ,  the  value 
v  of  the  game,  is  equal  to  the  minimum  of  ' 

v(2)u^  +  v(3)u2  +  v(4)max{u3,l-u2)  +  v(4)max{l-u^,l-u2,l-u3}  (12) 

:  1 

i 

over  all  u^,u2»u3  satisfying  0  £  u^  £  1,  j  ■  1,2,3. 

The  minimum  of  (12)  is  found  to  be  the  smallest  of  the  four  numbers: 
v(4)  +  v(5) ,  v(3)  +  v(5),  v(2)  +  v(3)  +  v(4),  -|(v(2)+v(3)+v(4)+v(5)) . 

f  fc 

Hence,  the  value  v  of  the  game  is: 

i  ■  i  '  1 

Case  1:  v(4)  £  v(3)  and  v(4)  +  v(5)  £  v(2)  +  v(3) 

i 

v  ■  v (4)  +  v(5) 


Case  2:  v(3)  £  v(4)  and  v(3)  +  v(5)  £  v(2)  +  v(4) 

v  -  v(3)  +  v(5) 

Case  3:  v(2)  +  v(3)  +  v(4)  £  v(5) 

v  -  v(2)  +  v(3)  +  v(4) 

Case  4;  v(2)  +  v(3)  -  v(5)  £  v(4)  £  v(3)  +  v(5)  -  v(2) 

V  -  |(v(2)+v(3)+v(4)+v(5))  -  i(l-v(l» 

Predictor's  optimal  strategies  for  each  of  the  four  cases  are  given 


in  Table  1. 
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4.  ZERO-SUM  GAMES  WITH  PENALTY  FOR  LONG  PERIODS. 

We  will  now  relax  the  restrictions  on  Emitter  and  let  him 
choose  freely  the  period  of  the  sequence  emitted.  We  will,  however 
impose  a  penalty  on  Emitter  for  choosing  too  large  periods.  The 
game  then  proceeds  as  follows: 

(1)  Emitter  chooses  a  period  t  ■  1,2,...  and  a  periodic 
binary  sequence  with  the  period  t. 

(2)  Predictor  is  allowed  to  observe  the  first  n  digits  of  that 
sequence  and  tries  to  predict  the  next  (irt-l)st  digit. 

(3)  The  payoff  to  Emitter  is  zero  if  the  prediction  is  correct 
and  is  equal  to  a  nonnegative  constant  c(t)  if  the  predic¬ 
tion  is  wrong  and  the  emitted  sequence  has  period  t. 

The  sequence  c(t),  t  -  1,2,...  represents  the  penalty  and  we 
assume  that  for  all  t  ■  1,2,... 

c(t)  c(t+l)  i  0  (13) 

To  avoid  the  trivial  case  we  assume  that  c(t)  >  0  for  at  least 
one  t . 

The  resulting  zero-sum  game  is  then  specified  by  the 
sequence  (13)  and  an  Integer  n  ■  1,2,...  .  We  will  denote  it  by 
G(c(t)  ,n) . 

The  space  S  of  Emitter's  pure  strategies  s  now  consists 
of  ordered  pairs 

s  ■  (t,st)  where  t  ■  1,2,...  and  st  € 

Hence  S  is  countably  infinite. 
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The  space  ♦  of  Predictor's  pure  strategies  as  same  as 
same  as  before. 

The  payoff  function  W(s,t)»  s  C  S,  $  £  ♦  is  for 
»  ■  (t,st),  s t  -  «  given  by 


W(s,*) 


r  c(t)  if  ♦(w|n)  # 
^  o  if  ♦(u|n)  - 


(14) 


Introducing  again  Predictor's  behavioral  strategies  f  we  have 
now 


W(s 


c(t)f0(w|n)  if  Wnfl  -  1, 
c(t)f1(w|n)  if  -  0. 


Since  the  space  4>  is  finite  and  by  (13)  the  payoff  function  is 
bounded  we  have 


Proposition  4.1:  The  game  G(c(t),n)  has  a  value  and  Predictor 
has  an  optimal  strategy. 

This  time,  however,  we  can  obtain  explicit  expressions. 
Proposition  4.2:  The  value  v  of  G(c(t),n)  is  given  by 


a€A 


u  cv.A\,a ujj 


(15) 


a  n 

and  the  strategy  f  defined  for  all  a  €  A  by 
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*,  v  c(X(aO)) _ 

£0(a)  "  c(X(aO))  +  c(X(al)) 

is  Predictor's  optimal  strategy. 


(16) 


Remark ;  Expressions  are  to  be  interpreted  as  0  in  (IS)  and 
as  any  number  x,  Oixil  in  (16). 


Proof :  Let  f  €  F. 

sup  W(s,r)  -  sup  { c (t )  max  max[I  (al)fft(a),I  (aO)f.(a)]} 
s€S  t-1,2, . . .  a€A  c  u 

-max  max{fn(a)  sup  c(t)I  (al),f.(a)  sup  c(t)I  (aO)). 
a€A  u  t-1,2, . . .  c  1  t-1,2, . . . 


From  (13)  for  a  -  0,1 


sup  c(t)I  (aa)  -  c(X(aa)). 
t-1,2,... 


Hence 


v  -  min  sup  W( s,r)  ■  max  h(a), 
f€F  s€S  a€An 


where 


h(a)  -  min  max  f  (a)c(X(aa))  .  (17) 

f (a)  a-0,1  “ 

However,  h(a)  is  ea'ily  recognized  to  be  the  value  of  the  two- 
by-two  matrix  game  with  the  matrix 


H(a) 


0  c(X(aO)f 

,c(X(al)) ,  0  __ 


so  that  (15)  and  (16)  hold  and  the  proposition  is  proved. 
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Let  us  now  turn  our  attention  to  Emitter's  strategies.  Since 
the  space  S  is  countable,  the  space  X  of  Emitter's  mixed  strat¬ 
egies  is  simply  the  set  of  all  probability  distributions  on  S. 

It  is,  however,  more  convenient  to  define  a  mixed  strategy  x  €  X 
as  the  pair 


x  -  (£,{yt}), 

where  £  ■  C^,^***'  is  a  probability  distribution  on  positive 

integers  and  u  are  distributions  on  finite  sets  ft  .  Clearly, 

%0  1* 

if  8  ■  (t,8t)  €  S  and  x  €  X  then  x(s)  -  yt(st>. 

*  *  *  n 

Proposition  4.3:  Let  a  ■  (<*-,...,<>)  6  A  be  such  that 

h(a*)  -  max  h(a),  (18) 

a€A 

where  h  is  defined  by  (17),  let  for  o  ■  0,1 

tQ  -  A  (a*a) , 

and 
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and 


**(st)  ■  | 


1  if  s  -  r  ,  t  -  t  ,  o  ■  0,1, 
t  t  o 

a 

arbitrary  otherwise, 


is  an  optimal  strategy. 


Proof:  Notice  first  that  since  by  Lemma  2.1 


max{X(aO)  ,X(al)}  > 


+  1 


the  (n+l)st  digits  of  the  two  sequences  ■  r  and  o>'  ■  r 

*0  1 

must  be  different.  Let  now  $  €  We  have 


W(x*,<fr)  -  [  W(s,$)x*(s)  -  C*  c(t0)  1 4  (a*)  -  w  .| 

s€S  *0  0+1 


+  C  c(t. )  | ♦(a*) 
C1  1 


cC^JcC^) 
c(tQ)  +  c(tx)  "  v* 


and  since  this  is  true  for  any  ♦  €  *,  x  is  an  optimal  strategy. 


Remark :  The  results  of  this  section  remain  true  even  if  the  time 
when  Predictor  is  required  to  make  a  prediction  is  allowed  to 
depend  on  the  observed  sequence  in  some  predetermined  fashion. 

For  instance,  we  may  ask  Predictor  to  predict  the  next  digit  as 
soon  as  he  observes  a  certain  number  k  of  ones  or  as  soon  as 
the  number  of  observed  digits  reaches  some  n,  whichever  occurs 


first. 
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The  space  *  of  Predictor's  pure  strategies  Is  then  the 
set  of  all  mappings  ^  from  a  finite  set  B  Into  {0,1},  where 
the  set  B  is  a  finite  collection  of  finite  strings  of  zeros  and 
ones  with  the  property  that  for  every  binary  sequence  u  €  Q 
there  is  a  string  b  €  B,  which  is  the  initial  segment  of  u. 

The  set  B  is  determined  by  the  requirement  imposed  upon 
the  prediction  time.  For  instance,  in  the  case  mentioned  above 
the  set  B  will  consist  of  all  strings  of  at  most  n  zeros  and  ones 
which  contain  at  most  k  zeros. 

The  only  modification  needed  in  order  to  extend  the  results 
of  this  section  to  this  more  general  case  is  to  replace  the  set 
An  by  the  set  B. 

Example :  Consider  the  game  G(c(t),n)  with  n  ■  5  and 

f  -|(6-t)  if  t  -  1,. .  .,6, 
c(t)  -  I 

l  0  if  t  >  6. 

To  solve  the  game  we  firpt  compute  X(a^a)  for  each  a^  €  A^, 
a  ■  0,1.  These  are  in  the  first  two  columns  of  Table  2.  Then 
compute  c(A(a^a))  for  (19)  and  h(a^)  from  (17)  -  see  Table  2. 

Hence  the  value  v  of  the  game  G(c(t),n)  is  v  ■  1.5 
and  Predictor '8  optimal  strategy  f  as  computed  from  (16)  is  in 
the  last  column  of  Table  2.  Emitter's  optimal  strategies  are  com¬ 
puted  from  Proposition  4.3.  We  obtain 
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C.  - 


.25 

if 

t  -  3, 

.75 

if 

t  -  5, 

0 

otherwise. 

and 


u*({ (Oil)", (100)"}) 


1,  w5({ (01101)  ,(10010)  »  -  1 


and  arbitrary  otherwise.  In  other  words.  Emitter  should  produce 
the  sequence  011011011...  or  110110110...  with  probability 
1/4  and  the  sequence  0110101101...  or  1101011010...  with 
probability  3/4. 
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5.  NONZERO-SUM  GAMES. 

In  the  previous  two  sections  we  dealt  exclusively  with 
zero-sum  games.  That  is,  the  payoff  to  Predictor  was  always 
assumed  to  be  the  negative  amount  of  the  payoff  to  Emitter. 

Let  us  now  modify  the  game  G(c(t),n)  of  the  previous 
section  by  adding  to  its  description  the  assumption: 

(4)  The  payoff  to  Predictor  is  one  if  his  prediction  is  a  correct 
one  and  zero  otherwise. 

We  obtain  a  new  nonzero-sum  game 

G'(c(t),n)  -  ,W2> , 

where  the  strategy  spaces  S,$  and  Emitter's  payoff  function 
are  same  as  in  the  zero-sum  game  (G(c(t),n).  The?  Predictor's 
payoff  function  is  defined  by 

/I  if  ♦(fil")  - 
W2(s,*)  -  j 

lo  if  ♦(u|n)  i 

where  m  is  the  sequence  s£. 

Proposition  5.1:  The  nonzero-sum  game  G'(c(t),n)  has  an  equili- 

*  *  *  * 
brium  point  (x  ,f  ) ,  where  f  is  defined  by  (16)  and  x  is 

*  * 

defined  as  in  Proposition  4.3  with  the  exception  that  £  ■  C  ■  1/2. 

C0  C1 

Furthermore,  all  equilibrium  points  are  equivalent  and  interchangeable 
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yielding  the  minimax  payoffs  v  and  1/2  to  Emitter  and  Predictor 
respectively. 

* 

Proof :  Let  a  be  defined  by  (18).  Then  for  all  f  (  F  by  the 

* 

definition  of  x 

,  *  *  ,  *  *  *  I 

W2(x  ,f)  -  £t  fQ (a  )  +  f^a  )  -  2, 
so  that  trivially 

sup  W* (x*,f)  -  W  (x*,f*)  - 
f€F  L  z 

* 

Since  f  is  a  minlmax  strategy  in  G(c(t),n) 

W^s.f*)  i  v 

for  any  s  €  S. 

*  * 

Now  by  the  definition  of  x  and  f 

A  A  AAA  AAA 

Wx(x  ,f  )  -  c(tQ)Ct  fx(a  )  +  c(t1)Ct  fQ(a  ) 

-  c(tQ)c(t1)lc(t0)  +  dt^)]  1  -  h(a*)  -  v. 

Hence 

sup  W.  (s,f*)  -  Wl(xV)  -  v, 
s€S 

A  A 

which  together  with  (20)  proves  that  (x  ,f  )  is  an  equilibrium 
point  yielding  minimax  payoffs. 


I 


Finally  it  is  easy  to  see  that  no  equilibrium  point  can 
yield  payoff  greater  than  minimax  for  either  of  the  players  since 
this  would  decrease  the  other  player's  payoff  below  his  minimax 
value.  Interchangebility  is  obvious. 

The  last  type  of  an  Emission-Prediction  game  considered  in 
this  paper  is  the  most  general  one  and  is  obtained  by  further 
relaxing  the  restrictions  on  Predictor. 

(1)  Emitter  chooses  a  period  t  •  1,2,.. .  and  a  periodic 
binary  sequence  with  this  period. 

(2)  Predictor  is  allowed  to  observe  as  many  digits  of  the 

sequence  as  he  wishes  and  after  he  decides  to  cease 

;  / 

observation  he  is  required  to  predict  the  next  digit. 

(3)  The  payoff  to  Emitter  is  zero  if  the  prediction  is 
correct  and  is  equal  to  c^(t)  3s  0  if  the  prediction 
is  wrong  and  the  sequence  has  period  t. 

(4)  The  payoff  to  Predictor  is  zero  if  his  prediction  is 
wrong  and  is  equal  to  c^Cn)  2s  0  if  it  is  correct 
and  the  length  of  the  observed  segment  was  n. 

i 

The  constants  c^(t)  and  c^Cn)  represent  penalties  for  long 
periods  and  lengthy  observation.  We  assume  that 

t  -  1,2,...  ■*  1  ■  c1(l)  2s  c1(t)  2s  c1(t+l),  lim  c1(t)  “  0 

"  t-*» 

and 
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n  -  0,1,...  m  1  -  c2(0)  i  c2(n)  *  c2(nfl),  lim  c2<n)  -  0< 

n+® 


!  I 


We  will  denote  the  resulting  nonzero-sum  game  by  G(c^(t), 

c2(n)).  ,  ;  ,  1 

1 

/  The  space  S  of  Emitter's  pure  strategies  In  this  game  Is 

same  as  before.  Predictor's  pure  strategies  €  ¥  ace  pairs 

i  !  i 

i  1  , 

4>  m  (»*♦)» 


where  B  Is  a  finite  set  of  finite  strings  of  zeros  and  ones 

I 

;  ; 

such  that  for  every  w  €  fl  there  Is  some  b  6  B  which  Is  an 

i  1 

Initial  segment  of  u.  4>  Is  a  mapping  frpm  B;  into  {0,1}. 
Clearly,  the  set  V  of  all  Predictor's  pure  strategies  is  now 

i 

countably  Infinite. 

:  i  •. 

i  • 

Let  8  -  (t,8t),  -  (B,$),  w  -  st>  b  €  B  be  the 

initial  segment  of,  u>,  and  let  n  be  the  length  of  b.  The 

•  i  i 

payoffs  are  now  given  by:  < 


fCl 

W1(s ,*)  -  / 

SO 


c,  (t)  if  «Kb)  + 


O'  if  ♦(bi  »  w^, 


w2(s»^j  "  ^ 


c2(n)  if  '♦  (b)  -  w^, 

•  ;  . 

0,  if  *(b)  *  0)^.  ( 


I 
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Proposition  5.2:  The  game  G(c,.  (t)  ,c„  (n))  has  an  equilibrium 
point. 


Proof :  Let  for  j  ■  1,2 


p.  .(s.s')  -  sup  *  W  (s,i|i)  -  W  (s',*)|  and 

J»*  ^0,  J  J 


P,  ,<*»*')  ■  sup|w  (s,l>)  -  W  (s,*') 
s€S  j  J 


be  Wald' 8  (pseudo)  metrics  [8]  on  S  and  $  respectively, 
e  >  0  and 


Let 


S£  -  {(t,st)  e  S  :  cx(t)  Zj)* 


*  -  UB,*)  €  i P  :  c,(|B|)  5tf}, 


where  |B|  is  the  length  of  the  longest  string  b  €  B.  Since 

lim  c.  (t)  ■  lim  c.(n)  ■  0  the  finite  sets  S  and  i|>  are 

t-**  ir** 

e-nets  in  the  (pseudo)  metric  spaces  (S,p^^)  and  (4*  ^2) 
respectively  and  hence  these  spaces  are  totally  bounded  ("condi¬ 
tionally  compact"  in  Wald's  terminology).  By  [8],  Theorem  2.1, 
so  are  then  the  spaces  (Syp^)  and  (<p »P 21) *  Let»  for  ^  "  1*2,..., 

it  it 

(x^,yfc)  be  an  equilibrium  point  in  the  finite  game 

a  * 

Then  there  is  a  subsequence  (x,  ,y,  ),  1  *  1,2,...,  and  a  pair 

it.  it. 

of  mixed  strategies  x  €  X,  y  €  Y  such  that  as  i  -►  •,  j  -  1,2, 


a  a 


a  a. 


p,  .  (x,  ,x  )  -  0  and  p.  7(y,  ,y  )  -►  0.  In  particular  this 

J  j  ,z 

Implies,  as  i  -►  *, 
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I  ,  *  *  .  *  *v  I 

lwi<xfc1*yk±)  ~  wi*x  »y  >1  *  °» 


(21) 


m  1 1 

| sup  W  (x,y  )  -  sup  W.  (x,y  )|  -►  0, 
X€X  x€X 


x€X 


wt  ft 

|sup  W1(s,yk  )  -  sup  Wj^x.y  )|  -►  0. 


1/k. 


*  *  V 

Finally,  since  (x,  ,y.  )  is  an  equilibrium  point  in  G. 

Ki  *1  l 

ft  ft 

sup  W  (x,y  )  -  W  (x.  ,y  ) 

*«i/kl  1  1  1 


(22) 

(23) 


whence  letting  i  in  (21)  through  (23)  we  obtain 


,  ,  *  *v 

sup  W.(x,y  )  -  W.(x  ,y  ). 

x€X 


Similarly, 

,  *  %  ,  *  *v 

sup  W„(x  ,y)  -  W„(x  ,y  ), 

y€Y 

so  that  (x*,y*)  is  an  equilibrium  point  in  G(c^(t)  ^(n))  and 
the  proposition  is  proved. 


Remark :  We  suspect  that  much  stronger  statements,  namely  that 
all  equilibrium  points  are  equivalent  and  interchangeable,  can 
be  proved,  but  we  have  not  been  able  to  prove  this. 
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